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ABSTRACT 

Data  fusion  is  an  increasingly  important  capability  in 
operational  command  and  control  systems.  A  fundamental 
function  of  data  fusion  processing  associates  unidentified 
contact  reports  from  multiple  sensors  to  a  common  target,  so 
that  the  commander  has  a  consistent  tactical  picture  from 
which  to  make  decisions.  Clearly  computer  simulations  should 
model  the  data  fusion  aspect  of  the  command  and  control 
process.  However,  since  data  fusion  algorithms  are  typically 
computationally  intensive,  they  have  been  neglected  in  tliese 
simulations. 

Tliis  paper  describes  a  new  approach  for  low 
resolution  simulation  of  data  fusion  processing.  Operational 
data  fusion  svstems  are  evaluated  on  known  data  sets  using 
four  performance  metrics  calculated  from  the  confusion 
matrix — a  tabulation  of  the  nimiber  of  contact  reports  from 
each  ground  truth  track  assigned  to  each  constructed  track. 
The  simulation  algoritlim  described  in  this  paper  makes 
probabilistic  data  fusion  decisions  based  on  values  input  for 
tltese  metrics.  This  paper  also  describes  an  object-oriented 
implementation  of  the  algoritlun  in  C++  and  presents 
numerical  results  from  a  simulation  study  of  tlie  algoritlun 
itself,  showing  its  ability  to  produce  data  fusion  results  \vitli  tlie 
input  performance  characteristics. 

INTRODUCTION 

In  a  report  to  Congress,  simunarized  by  Tolcliin 
(1989),  the  Departments  of  Defense  and  Energy  identified  data 
flision  as  one  of  tlie  22  tecluiologies  critical  to  national  security 


and  the  long-term  qualitative  superiority  of  U.S.  weapons 
systems.  The  directors  of  the  Department  of  Defense  (DOD) 
command  and  control  (C^)  research  and  development  centers 
have  formed  a  group  to  coordinate  data  fiision  issues  among 
the  services.  This  group  has  defined  data  fusion  as:  '‘A  process 
that  deals  with  tlie  association,  correlation,  and  combination  of 
data  and  information  from  single  and  multiple  sources  to 
acliieve  refined  position  and  identity  estimates,  and  complete 
and  timely  assessments  of  situations  and  threats  as  well  as  their 
significance.’’  (Wliite  1991)  Data  fusion  is  an  aspect  of 
virtually  all  C^  systems  in  one  way  or  another. 

The  list  also  identified  computer  simulation  and 
modeling  as  a  critical  technology.  Computers  have  long  been 
used  for  a  variety  of  defense-related  simulation  purposes, 
including  tactics  development  and  assessment,  weapons 
requirements  and  design,  and  operational  training.  However, 
recent  increases  in  computer  processing  speed,  the 
development  of  high-speed,  high-capacity  networks,  and  tlie 
emergence  of  object-oriented  programming  languages,  such  as 
C++,  offer  tlie  opportunity  for  even  more  sophisticated 
simulations.  Large-scale  simulations  will  be  put  togetlier  from 
a  menu  of  object  models  developed  and  maintained  by 
geographically  separated  organizations  witli  expertise  in 
specific  areas.  These  object  models  undoubtedly  will  be 
developed  at  variable  resolutions.  For  example,  high  resolution 
models  for  sensor  design  studies  may  include  detailed 
environmental  data  and  solve  complicated  propagation 
equations,  while  low  resolution  models  appropriate  for 
campaign-level  simulations  may  only  use  the  simple  ‘'cookie 
cutter”  detection  approach.  Davis  and  Huber  (1992)  discuss  a 
number  of  issues  related  to  variable  resolution  modeling  and 
conclude  tliat  liierarcliical  models  provide  the  best  approach. 

This  paper  presents  work  supporting  an  Office  of 
Naval  Research  task  to  investigate  liigh  performance 
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computing  for  modeling.  The  goal  was  to  develop  data 
fusion  object  models  at  several  resolutions  and  use  them  for 
analysis  and  benchmark  performance  testing  on  a  variety  of 
single  processor  and  multiple  processor  platforms.  This  paper 
proposes  a  unique,  low  resolution  data  fusion  simulation 
algorithm,  describes  its  implementation  in  a  C++  class 
hierarchy  that  could  support  data  fusion  simulations  at  other 
resolutions,  and  provides  numerical  results  of  a  simulation 
study  of  the  new  algorithm  itself 

DATA  FUSION  ALGORITHMS  AND  THEIR 
EVALUATION 

The  DOD  data  fiision  group  (White  1991)  has 
defined  four  levels  of  data  fusion  processing.  Level  1 
processing  is  characterized  by  data  alignment,  data/object 
correlation,  and  position  and  identity  estimation.  The  goal  of 
tliis  level  is  a  complete  and  accurate  representation  of  the 
targets  in  a  tactical  scene.  Level  2  processes  use  the  tactical 
picture  and  attempt  to  explain  the  situation.  Level  3  processes 
interpret  the  situation  in  terms  of  the  threat  to  friendly  forces. 
Finally,  Level  4  processes  monitor  the  previous  three  levels  to 
evaluate  the  effectiveness  of  the  surveillance  and  data  fusion 
processing  for  the  purpose  of  resource  management. 

Most  data  fusion  development  efforts  have  focused  on 
the  Level  1  problem,  wliich  historically  has  been  called  report- 
to-track  or  track-to-track  correlation.  Simply  stated,  the  goal  is 
to  construct  target  tracks  from  incoming  contact  reports  such 
that  each  constructed  track  (CT)  contains  reports  from  one  and 
only  one  ground  truth  track  (GT)  and  each  GT  is  represented 
by  one  and  only  one  CT. 

Kendall,  Stuart,  and  Ord  (1983)  describe  two  types  of 
statistical  classifrcation  procedures  that  have  been  applied  to 
tlie  Level  1  problem  when  the  contact  reports  contain 
numerical  data.  In  the  situations  with  a  training  data  set 
(previously  identified  contact  reports  from  each  GT), 
discrimination  procedures  develop  a  classification  rule  tliat 
identifies  new,  unidentified  contact  reports  as  one  of  tlie  knowm 
GT’s.  In  other  situations,  clustering  teclmiques  divide  contact 
reports  into  CT’s  that  are  as  distinct  as  possible.  In 
discrimination  the  number  of  GT’s  is  known,  wliile  in 
clustering  it  is  to  be  determined.  Batch  and  sequential 
procedures  have  been  developed  for  both  discrimination  and 
clustering.  Most  report-to-track  correlation  algorithms  are 
sequential  clustering  procedures. 

Performance  evaluation  of  statistical  data  fusion 
algoritlims  is  an  important  aspect  of  their  development.  For 
discrimination  metliods,  well-founded  teclmiques,  such  as 
crossvalidation,  can  be  used  to  assess  overall  classification 


performance  by  estimating  error  rates.  However,  evaluating 
clustering  performance  is  not  as  straightforward.  Typically 
these  algorithms  have  parameters  that  can  be  adjusted  to 
optimize  some  aspect  of  performance,  for  example,  number  of 
CT’s  created.  In  this  case  there  are  several  aspects  to 
performance  and  a  variety  of  measures  is  necessary.  Most  are 
calculated  from  the  '‘soK:alled”  confusion  matrix,  which  is  a 
common  basis  for  evaluating  classification  procedures  on 
identified  data  sets.  In  the  data  fusion  context,  tlie  confusion 
matrix  has  one  column  for  each  GT  and  one  row  for  each  CT. 
The  (i  j)  entries  are  the  number  of  reports  from  the  j^  GT  the 
algorithm  assigns  to  the  i^  CT. 

Fox,  Mills,  Thode,  Crespo,  and  Mendigucliia  (1991) 
define  four  performance  measures  that  correspond  to  tlie 
possible  decisions  a  Level  1  data  fusion  algorithm  can  make  for 
each  contact  report:  correct  assigrmient,  mis-assigimient, 
fragmentation,  and  ambiguity.  An  algorithm  makes  a  correct 
assignment  when  it  puts  the  contact  report  in  a  CT  that 
represents  the  correct  GT.  An  algoritlim  makes  a  mis- 
assignment  when  it  puts  the  contact  report  in  a  CT  that  is 
represents  an  incorrect  GT.  For  evaluation  purposes,  the 
‘'correct  GT’  for  a  CT  is  assumed  to  be  the  one  vvitli  the  most 
contact  reports  assigned,  even  though  that  might  not  be  the  GT 
tliat  initiated  the  CT.  Note  that  a  single  GT  may  be 
represented  by  more  than  one  CT,  but  that  a  CT  will  represent 
only  one  GT.  (Ties  need  to  be  arbitrarily  decided.) 
Fragmentation  occurs  when  the  algorithm  creates  a  new  CT  for 
a  particular  GT  when  there  is  already  an  existing  CT 
established  for  the  GT.  The  contact  creating  tlie  extra  track  is 
counted  as  a  fragmentation  error,  but  subsequent  contacts  in 
that  CT  are  considered  correct  decisions,  as  long  as  the  largest 
number  of  assigned  reports  come  from  that  GT.  An  algoritlim 
makes  an  ambiguous  assignment  when  it  assigns  the  contact 
report  to  no  CT.  For  ease  of  evaluation,  the  set  of  ambiguous 
contacts  report  is  usually  identified  by  a  single  pseudo-CT,  e.g., 
CT  0.  The  performance  measmes  in  Fox  et  al  (1991)  are  the 
ratios  of  niuiiber  of  contacts  assigned  in  each  decision  category 
to  total  number  of  contacts  processed. 

To  calculate  these  perfoniiance  measures,  let  N  be  the 
total  number  of  contacts  processed,  Na  <  N  be  the  number  that 
are  ambiguous,  #GT  be  the  number  of  ground  truth  tracks  (= 
number  of  columns  in  the  confusion  matrix),  and  #CT  tlie 
number  of  constructed  tracks  (=  number  of  row^s).  Then, 

2(row:  maximum)  -  niax(#CT-#GT,0) 

car  = - -  (1) 
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where  car  is  correct  assignment  ratio,  mar  is  mis-assignment 
ratio,  >  is  fragmentation  ratio,  and  ar  is  ambiguity  ratio.  The 
summation  in  equations  (1)  and  (2)  totals  the  largest  entries  in 
each  row.  The  maximum  function  in  equations  (1)  and  (3) 
handles  the  case  of  fewer  constructed  tracks  than  ground  trutla 
tracks.  It  is  clear  that  these  performance  metrics  add  to  1  and 
hence  consider  the  range  of  algorithm  decisions.  However, 
their  ability  to  measure  the  quality  of  data  fusion  algorithm 
decisions  is  less  apparent.  In  the  extreme,  perfect  data  fusion 
performance  corresponds  to  a  square  confusion  matrix  with  all 
tlie  non-zero  entries  along  a  diagonal;  in  tills  case  we  can  easily 
see  that  the  correct  assignment  ratio  is  1  and  the  other  metrics 
are  0.  Fox  et  al.  (1991)  discuss  these  and  other  performance 
metrics  in  more  detail  and  provide  some  numerical  examples. 

A  DATA  FUSION  SIMULATION  ALGORITHM 
USING  CONFUSION  MATRIX  METRICS 

Data  fusion  might  be  an  aspect  of  computer 
simulation  studies  for  a  variety  of  reasons.  First,  in  a  campaign 
simulation,  tactical  decisions  depend  on  the  state  of  the 
scenario,  which  is  knoiim  and  input  by  die  model  user.  The 
simulation  will  have  added  realism  if  the  decisions  are  based 
on  a  perceived  tactical  siuiation,  not  the  actual  ground  trudi 
situation.  The  perceived  situation  will,  of  course,  be  different 
from  the  actual  one.  Data  fusion  algoridims  (as  a  key  part  of 
C"  s}'stems)  are  the  mechanism  by  which  tacdcal  conuiianders 
get  their  view  of  the  operational  situation.  Second,  the  effect  of 
data  fusion  performance  on  die  tacdcal  outcome  might  be  of 
interest  in  C"  studies.  Finally,  die  siniuladon  might  be  focused 


on  the  data  fusion  process  itself,  and  different  algorithmic 
approaches  evaluated  in  a  number  of  scenarios. 

Since  actual  data  fusion  algorithms  are  not  required 
in  the  first  two  modeling  situadons,  a  low  level  resoludon 
approach  can  be  used.  That  is,  in  these  cases  the  data  fusion 
processing  can  be  simulated  probabilisitcally  since  ground  trudi 
is  known.  The  algorithm  described  here  uses  the  performance 
metrics  cor,  mar,  fir,  and  ar  as  simuladon  parameters 
represendng  the  average  data  fusion  performance  desired.  So, 
the  first  step  is  to  assign  values  between  0  and  1  to  each  of 
these  four  parameters,  such  that  their  sum  is  1.  Simulated  data 
fusion  processing  of  a  contact  report  begins  by  generadng  a 
random  number  u  uniformly  on  [0,1].  If  0  <  u  <  car,  the 
algorithm  makes  a  correct  assignment  decision;  if  car  <  u  <  car 
+  mar,  a  mis-assignment  decision;  if  car  +  mar  <  u  <  car  + 
mar  +  fi%  a  fragmentadon  decision;  and  an  ambiguity  decision 
otherwise. 

While  the  general  implementadon  of  the  four 
decisions  is  straightforward,  a  number  of  special  cases  need  to 
be  recognized  and  handled  by  the  data  flision  simuladon 
algorithm.  Table  1  defines  the  five  possible  cases.  The  GT 
column  indicates  whedier  the  GT  in  the  current  contact  report 
has  already  been  processed;  the  CT  column  denotes  whether 
there  already  is  a  constructed  track  associated  witli  this  GT  (if 
an  old  GT)  or  whedier  there  are  any  CT’s  at  all  (if  new  a  GT); 
and  the  MGT  column  wliether  diere  is  more  than  one  GT  with 
associated  CT’s.  The  reasons  for  these  particular  columns  will 
become  apparent  as  we  discuss  die  cases.  The  next  set  of 
columns  in  Table  1  provide  die  data  fusion  acdon  taken  in  each 
of  the  five  cases  for  possible  values  of  u. 

Case  1  is  die  general  case  in  which  all  four  decisions 
can  be  implemented.  In  case  2  only  one  GT  has  associated 
CT(s),  although  more  dian  one  GT  may  have  been  processed. 


Table  1.  Simulated  Data  Fusion  Algorithm  Processing  Cases 


Cases 

Data  Fusion  Decision 

Num  GT  CT  MGT' 

0  <  u  <  cor  car  <  u  <  car  +  car  +  war  <  u  <  car  +  mar  +  fr  < 

mar  cor  +  mar  +  fr  u  <  1 

1  yes  yes"  yes 

2  yes  yes"  no 

3  yes  no"  — 

4  no  yes^  — 

5  no  no^  — 

correct  assign  mis-assign  fragment  ambiguous 

correct  assign  fragment  ambiguous 

create  CT  mis-assign  ambiguous 

create  CT  mis-assign  ambiguous 

create  first  CT 

‘Multiple  GT’s  with  associated  CT’s  'Associated  to  GT  ^Any  CT,  associated  or  not 


In  case  2,  niis-assignments  are  not  possible  and  are  combined 
with  fragmentation  actions.  Case  3  represents  the  situation  in 
which  the  GT  has  been  seen  before,  but  no  CT  has  been  created 
for  it  because  the  previous  contacts  reports  were  erroneously 
processed.  In  this  case  fragmentation  actions  are  not  possible 
and  are  grouped  with  mis-assignments.  Case  4  represents  die 
situation  in  which  the  GT  has  not  been  seen  before,  but  there 
are  existing  CT’s.  This  case,  as  with  case  3,  does  not  allow 
fragmentation  actions,  which  are  again  combined  with  mis- 
assignments.  In  cases  3  and  4,  the  correct  assignment  action  is 
to  create  a  new  track.  Finally,  case  5  is  the  special  case  of  the 
first  contact  report  processed.  No  matter  what  values  of  the 
confusion  matrix  metrics  are  specified,  the  decision  is  always  to 
create  the  first  track  with  the  first  contact  report. 

Pseudo-code  for  the  simulated  data  fusion  algorithm 
is  provided  in  Figure  1.  It  is  a  set  of  nested  if-then-else 
constructions  to  identify  and  process  the  five  cases  described  in 
Table  1.  Required  processing  actions  are  indicated  by  boldface 
type.  The  code  in  Figure  1  reflects  two  arbitrary  decisions.  In 
die  case  of  a  correct  assignment  decision,  there  may  be  more 
dian  one  CT  to  which  the  contact  report  could  be  assigned. 
The  convendon  applied  is  to  assign  it  to  the  CT  created  most 
recendy,  represented  by  the  highest  CT  number.  This  tends  to 
somewhat  reflect  how  actual  data  fusion  algoridims  behave. 
The  convention  could  just  as  easily  been  to  assign  it  to  the  first 
CT  created  or  randomly  to  one  of  the  CT’s  associated  ivith  the 
GT.  For  mis-assignment  decisions,  diere  is  usually  more  than 
one  CT  to  wliich  the  contact  report  can  be  mis-assigned.  The 
algorithm  randomly  selects  one  of  these  CT’s.  Bodi  of  these 
design  decisions  do  not  affect  die  values  of  confusion  matrix 
performance  metrics  calculated  from  the  results  of  die 
simulated  data  fusion  algoridim. 

OBJECT-ORIENTED  MODELING  FOR  DATA 
FUSION 

Even  though  die  data  fusion  simulation  algoridim 
described  in  the  previous  section  is  quite  simple,  as  data  fusion 
algoridims  go,  it  can  nonedieless  be  implemented  using  die 
object-oriented  paradigm.  This  structure  can  be  used  for  more 
complicated  algoridims  as  well.  This  section  is  not  meant  to 
present  a  rigorous  object-oriented  design  niediodology,  but 
rather  discuss  some  of  the  issues  involved. 

The  first  step  is  a  description  of  the  problem.  In  diis 
case,  we  need  a  (simulated)  data  fusion  algorithm  diat  assigns 
contact  reports  to  tracks.  So,  our  first  breakdown  of  required 
objects  would  be:  (1)  data  fusion  algoriduii;  (2)  report;  and  (3) 
track.  A  track  object  should  contain  the  reports  assigned  to  it. 
Hence,  we  need  a  fourth  object,  a  list  of  reports.  Finally,  since 


Assign  performance  values:  car^  mar^fr^  ar 
Compute  random  number  u  from  [0,1] 
if  (existing  ground  truth  (GT))  { 
if  (existing  constructed  track  (CT)  for  this  GT)  { 
if  (more  than  one  GT  for  set  of  CT’s)  { 
if  (u  <  car)  Correctly  assign  to  latest  existing  CT 
else  if  (u  <  car+mar)  Incorrectly  assign  to  random 
CT 

else  if  (u  <  car+niai^fr)  Incorrectly  create  new  CT 
else  Report  is  ambiguous 

} 

else  { 

//  Only  one  GT  for  set  of  CT’s 
if  (u  <  cor)  Correctly  assign  to  latest  existing  CT 
else  if  (u  <  cat^rnai^fr)  Incorrectly  create  new  CT 
else  Rei)ort  is  ambiguous 

} 

} 

else  { 

//  No  existing  CT  for  this  GT 
if  (u  <  car)  Correctly  create  new  CT 
else  if  (u  <  car+niar-^fr)  Incorrectly  assign  to  random 
CT 

else  Report  is  ambiguous 

} 

else  { 

//  New  GT 
Add  to  GT  list 
if  (At  least  one  CT)  { 
if  (u  <  car)  Correctly  create  new  CT 
else  if  (u  <  car+nior+fr)  Incorrectly  assign  to  random 
CT 

else  Report  is  ambiguous 

} 

else 

//No  CT’s 

Correctly  create  first  track 

}  _ 

Figure  1.  Pseudo-Code  for  Simulated  Data  Fusion 
Algorithm 

we  expect  to  deal  with  more  than  one  track,  we  need  a  list  of 
tracks  as  the  fifth  object. 

A  rough  view  of  how  tliese  objects  might  interact  is 
showm  in  Figure  2.  A  Report  object  is  read  by  a  Data_Fusion 
object  which  processes  tlie  Report  against  the  Track_List  (i.e., 
data  base)  and  assigns  the  Report  to  a  specific  Track  object  in 
the  Track_List  or  creates  a  new  Track  in  the  Track_List.  The 
Track  object  maintains  a  Report_List  object  containing  the 
Reports  assigned  to  that  Track. 


Figure  2.  Data  Fusion  Object  Interactions 


Figure  3  shows  C++  class  declarations  for  these 
objects.  Note  that  tliese  represent  just  a  bare  minimum  of  data 
members  and  member  fimctions  to  illustrate  the  idea.  In 
addition,  for  clarity'  all  declarations  are  public.  Data_Fusion  is 
a  base  class  for  the  DF_Simulation  class.  Otlier  derived  classes 
in  tire  liierarchy  could  represent  different  algorithmic 
approaches  to  data  fusion.  The  Data_Fusion  base  class 
essentially  only  manages  the  track  data  base  for  any  derived 
class  through  the  Track_List  data  member  trklst.  Tills  base 
class  also  defines  a  virtual  function  named  process,  which  each 
derived  class  will  re-define  to  implement  specific  data  flision 
algorithms.  Hence,  process  takes  a  Report  reference  as  an 
argument  and  returns  the  track  number  index  to  which  tlie 
algorithm  assigns  the  report.  The  DF_Simulation  class 
implements  tlie  data  fusion  simulation  algoritlim  discussed  in 
the  previous  section.  Hence,  tills  class  has  four  data  members 
for  tlie  confusion  matrix  perfomiance  metrics:  correct 
assignment  ratio,  etc.  Its  member  fimction  redefines  tlie  virtual 
function  process. 

The  Track_List  class  manages  a  list  of  Tracks.  Its 
data  members  reflect  the  number  of  tracks  in  the  list  and  tlie 
array  of  Track  pointers.  Member  flinctioiis  allow  reading  tlie 
Track_List  object  from  and  writing  it  to  a  file.  The  o\-erloaded 
array  reference  operator  ([  ])  allows  access  to  a  specific  Track 
in  the  list.  The  overloaded  additixe  assignment  operator  (+=) 
allow's  adding  a  nexv  Track  to  tlie  list,  and  the  remove  function 
allows  deleting  a  Track  from  tlie  list  by  specifying  its  index 
number.  The  Track  class  has  a  data  member  track_nuniber  for 
identification  purposes  and  a  Report__List  data  member  for 
managing  the  Reports  assigned  to  tlie  Track  by  tlie 
Data_Fusioii  object.  The  Report^List  class  is  analogous  to  tlie 
Track_List  class.  Finally,  the  Report  class  has  two  basic  data 
members,  the  ground  taith  track  number  and  the  constmcted 
track  number.  Ceruiinly  in  actual  applications,  additional  data 
members  and  member  functions  would  be  added  to  these 


class  Data_Fusion  { 
public: 

Track_List  trklst; 

virtual  int  process  (const  Report&)  (return  0;} 

}; 

class  DF_Simulation  :  public  Data_Fusion  { 
public: 

double  correct_assignment_ratio,  mis_assignment_ratio, 
fragmentation_ratio,  ambiguity_ratio; 
setratios  (const  double&,  const  double&,  const  double&, 
const  double&); 
int  process  (const  Report&); 

}; 

class  Track_List  { 
public: 

int  number_tracks; 

Track  **trk_ptr; 

int  Read  (const  char  ^filename); 

Track&  operator[]  (const  int&); 

Track_List&  operator  +=  (const  Track&); 

void  Remove  (const  int&); 

int  Write  (const  char  *filenanie); 

class  Track  { 
public: 

int  track_nuniber; 

Report_List  rptlst; 

}; 

class  Report_List  { 
public: 

int  nuniber_reports; 

Report  **rptjptr; 

int  Read  (const  char  *filenanie); 

Report&  operator[]  (const  int&); 

Report_List&  operator+=  (const  Report&); 
void  Remove  (const  int&); 
int  Write  (const  char  *filenanie); 

) . 

class  Report  { 
public: 

int  GT_Number;  //  for  evaluation  purposes 

int  CT_Number;  //  assigned  by  data  fusion  algorithm 

K _ _ 

Figure  3.  Data  Fusion  Related  C++  Class  Declarations 
classes  eitlier  directly  or  through  inheritance. 

Figure  4  shows  a  C++  code  fragment  that  implements 
tlie  object  interactions  in  Figure  2  using  object  instances  from 
the  classes  in  Figure  3.  The  code  illustrates  the  basics  of 
object-oriented  progranuiiing  to  instantiate  a  data  fusion  object 
and  use  it  for  processing  Reports.  This  example  certainly  does 


Report  rpt; 

DF_Simulation  good_df_sim; 
int  trkndx; 

good_df_sim.setratios(0.85,  0.075,  0.025,  0.05); 

//  loop  on  reports 

{ 

trkndx  =  good_df_sim.process(rpt); 
if  (trkndx  <=  good_df_sim.trklst.number_tracks) 
good_df_sim.trklst[trkndx].rptlst  +=  rpt; 
else  { 

Track  temp^trk; 

temp_trk.track_number  =  trkndx; 
good__df_sim.trklst  +=  temp_trk; 
good_df_sim.trklst[trkndx].rptlst  +=  rpt; 

} 

rpt.CT_Number  =  trkndx; 


Figure  4.  C++  Code  Fragment  for  Data  Fusion 

not  represent  robust  code,  particularly  since  tlie  function 
definitions  have  not  been  specified.  The  first  line  in  Figure  4 
creates  a  Report  object  named  rpt  that  is  used  to  hold  Report 
values  for  processing.  The  second  line  instantiates  a 
DF  Simulation  object  named  good_df_sim  and  line  four 
assigns  appropriate  values  to  its  data  members.  The  main 
context  of  the  code  fragment  is  a  loop  on  reports  such  Uiat  tiie 
variable  rpt  contains  current  Report  values  during  each 
iteration.  The  good_df_sim  process  function  returns  tlie  Track 
number  index  to  which  tlae  data  fusion  simulation  algoritlim 
assigns  rpt.  If  this  is  less  than  or  equal  to  the  number  of  Tracks 
in  the  Track_List  for  good_df_sim,  tlien  the  Report  is 
appended  to  tlie  Report_List  for  tliat  Track.  If  not,  a  temporary 
Track  is  created  with  the  new  track  number,  added  to 
good_df_sim’s  Track_List,  and  the  Report  appended  to  this 
new  Track's  Report  List.  Finally,  tlie  track  index  number  is 
assigned  to  tlie  current  Report.  The  important  thing  to  note 
about  tliis  example  is  the  simpliciu^  of  tlie  source  code  once  the 
objects  have  been  defined. 

NUMERICAL  RESULTS 

An  object-oriented  version  of  the  data  fusion 
simulation  algorithm  was  implemented  in  Borland  C++  on  a 
DOS  PC  and  numerical  results  were  obtained  for  four  scenarios 
and  six  different  sets  of  performance  parameters.  The 
scenarios  were:  1  track  witli  100  reports;  10  tracks  witli  25 
reports  each;  25  tracks  with  10  reports  eiich;  and  100  tracks 
with  1  report  each.  The  parameter  settings  were  chosen  to 
represent  a  data  fusion  algoritlmi  with  good  performance,  as 
reflected  in  Fox  et  a!.  (1991),  an  algoritlim  witli  mediocre 


performance,  and  four  limiting  cases.  The  algorithm  was  run 
100  times  on  each  of  the  24  scenario-parameter  combinations 
and  a  confusion  matrix  and  performance  metrics  calculated  for 
each  run.  The  performance  metrics  were  then  averaged  over 
the  100  runs,  with  the  results  expected  to  match  the  input 
performance  parameters.  In  the  scenarios  with  multiple  tracks 
and  multiple  reports  per  track,  the  inner  loop  was  on  tracks. 

The  numerical  results  are  summarized  in  Table  2. 
We  see  that  the  simulation  algorithm  reproduces  the  input 
parameters  values  reasonably  well.  In  the  cases  of  good  and 
mediocre  performance  (the  first  two  sections  in  Table  2),  the 
results  are  within  expected  statistical  variation.  For  the 
scenarios  with  one  ground  trutli  track,  there  is  no  possibility  of 
mis-assignment.  For  the  scenarios  with  one  report  per  track, 
there  is  no  possibility  of  track  fragmentation.  In-between  these 


Table  2.  Numerical  Results  From  Simulated  Data 
Fusion  Algorithm 


Scenario 

Performance  Values 

Trks 

Rpts 

car 

mar 

fr 

ar 

1 

100 

.859 

0 

.095 

.047 

10 

25 

.849 

.078 

.024 

.048 

25 

10 

.852 

.077 

.022 

.048 

100 

1 

.849 

.100 

0 

.051 

Actual 

.850 

.075 

.025 

.050 

1 

100 

.504 

0 

.393 

.103 

10 

25 

.501 

.253 

.144 

.102 

25 

10 

.503 

.280 

.119 

.099 

100 

1 

.507 

.400 

0 

.095 

Actual 

.500 

.250 

.150 

.100 

1 

100 

1 

0 

0 

0 

10 

25 

1 

0 

0 

0 

25 

10 

1 

0 

0 

0 

100 

1 

1 

0 

0 

0 

Actual 

1 

0 

0 

0 

1 

100 

.010 

0 

.990 

0 

10 

25 

.184 

.756 

.060 

0 

25 

10 

.110 

.890 

0 

0 

100 

1 

.010 

.990 

0 

0 

Actual 

0 

1 

0 

0 

1 

100 

.010 

0 

.990 

0 

10 

25 

;  .184 

.756 

.060 

0 

25 

10 

.110 

.890 

0 

0 

100 

1 

.010 

.990 

0 

0 

Actual 

0 

0 

1 

0 

1 

100 

.010 

0 

0 

.990 

10 

25 

.004 

0 

0 

.996 

25 

10 

.004 

0 

0 

.996 

100 

1 

.010 

0 

0 

.990 

Actual 

0 

0 

0 

1 

extremes,  the  confiision  matrix  metrics  calculated  from  the 
results  agree  very  well  with  input  values.  For  the  case  of 
perfect  correlation  performance  (actual  the  simulation 

algorithm  performs  flawlessly.  However,  for  the  otlier  limiting 
cases  of  input  parameter  values  (zeros  and  ones),  the  computed 
metrics  do  not  match  the  inputs  as  closely,  but  the  differences 
are  expected.  No  data  fusion  algorithm,  real  or  simulated,  can 
perform  totally  erroneously — it  should  always  get  the  first 
contact  report  correct.  This  does  not  explain  all  the 
discrepancies,  though.  In  the  scenarios  of  10  groimd  truth 
tracks  with  25  contact  reports  each  and  25  tracks  with  10 
reports  each,  the  computed  correct  assignment  percentages  are 
0.184  and  0.110,  respectively,  even  though  perfect  mis- 
assignmenl  or  total  fragmentation  is  input.  Tliis  is  the  aspect 
where  the  simulation  algoritlim  and  the  confusion  matrix 
metrics  diverge  slightly.  The  algoritlim,  in  randomly  mis- 
assigning  reports  to  constructed  tracks,  mis-assigns  more 
incorrect  reports  to  the  track  than  correct  ones.  The  algorithm 
keeps  track  of  which  groimd  trutli  track  is  represented  by  each 
constaicted  track.  The  computation  of  perfomiance  metrics 
from  the  confusion  matrix  assume  the  association  is  based  on 
tlie  most  reports  assigned.  Tliis  is  not  a  serious  discrepancy, 
and  only  affects  results  when  input  performance  parameters  are 
set  at  or  near  the  extremes. 

CONCLUSIONS 

Data  fusion  is  an  important  aspect  of  virtually  all  C" 
si’stems  and  should  be  incorporated  into  computer  simulations 
for  added  realism.  The  probabilistic  algoritlim  for  simulating 
data  fusion  proposed  in  this  paper  can  be  used  as  a  low 
resolution  model  in  these  applications.  The  relatively  simple 
algoritlim  is  based  on  four  confusion  matrix  perfomiance 
metrics  which  represent  the  range  of  possible  decisions  a 
report-to-track  correlation  algoritlim  can  make:  correct 
assigmiient,  niis-assignment,  ffagmentation,  and  ambiguity. 
The  numerical  results  from  the  simulation  study  of  tlie 


algorithm  itself  show  that  it  satisfactorily  reproduces  the  input 
performance  metrics.  Finally,  the  object-oriented 
implementation  outlined  in  this  paper  can  also  serve  as  the 
framework  for  additional  data  fiision  algorithm  models  at  other 
resolution  levels. 
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